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© isolation of full-length cDNA clones and capped mRNA. 



© A protein comprising at least a first functional 
site having the ability to bind the cap structure of 
mRNA and a second functional site having the ability 
to bind a solid support matrix in such a manner as to 
allow the first functional site to be immobilized and 
still remain functionally accessible to interact with 
^the cap structure of mRNA. Also within the scope of 
^.the present invention is a method for generating a 
r-cDNA library mostly containing full-length cDNAs. 
O The method comprises the incubation of a mixture 
pvj comprising mRNA.cDNA hybrids with 1) a single 
r*s strand RNA specific nuclease and 2) the above- 
mentioned protein. The resulting mixture is then 
O passed through a column comprising a support ma- 
trtx having the ability to bind the second functional 
^71 site of the above-mentioned protein in order to se- 
lectively bind complete mRNA:cDNA hybrids. The 
mRNA:cDNA hybrids are then competitively eiuted 



with a cap analog and full-length cDNA strands are 
separated and recovered. The present invention also 
includes a method for purifying capped mRNA using 
the above-mentioned protein. The process com- 
prises the incubation of a mixture containing mRNA 
with the above-mentioned protein, passing the result- 
ing mixture through a column comprising the support 
matrix having the ability to bind to the second func- 
tional site of the above-mentioned protein in order to 
selectively bind capped mRNAs, and competitively 
eluting the capped mRNAs with a cap analog. 
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Isolation of full-length cONA clones and capped mRNA. 



BACKGROUND OF THE INVENTION 

Complementary DNA contains the information 
coding for the synthesis of proteins. The ability 
generate complementary DNA (cDNA) libraries is 
one of the most fundamental procedures in con- 
temporary molecular biology. Research involving 
the use of cDNA libraries has already led to signifi- 
cant breakthroughs in our understanding of cancer, 
AIDS and numerous other medical concerns. Con- 
sequently, there is a rapidly expanding commercial 
interest in this procedure because of its enormous 
current and future potential applicability. For exam- 
ple, a growing number of companies are marketing 
"ready made" cDNA libraries or kits which simplify 
the task of preparing a cDNA library. 

While the procedures for generating cDNA li- 
braries are being continuously modified and im- 
proved, there are serious drawbacks in the current 
methods that have not been adequately addressed. 
As a result, cDNA cloning is generally inefficient, 
making it both cumbersome and most unfortunately 
very time consuming. 

In standard methods currently used for the 
preparing of cDNA libraries, the mRNA in the cell 
is isolated by virtue of the presence of a 
polyadenylated tail present at its 3 end which 
binds to a- resin specific for this structure (oligo T- 
chromatography). The purified mRNA is then 
copied into cDNA using the enzyme, reverse tran- 
scriptase, which starts at the 3' end of the mRNA 
and proceeds towards the 5 end. Second strand 
synthesis is then performed. Linkers are added to 
the ends of the double stranded cDNA to allow for 
its packaging into virus or cloning into plasmids. At 
this stage, it is in a form that can be propagated, 
the sum of which is termed the cDNA library. 

Unfortunately, the major problem with the ac- 
tual technology is that the majority of the cDNAs 
present in any given library are not full-length be- 
cause the reverse transcriptase enzyme in the ma- 
jority of cases does not make a complete copy of 
the mRNA. Obviously, this creates serious prob- 
lems, especially if one takes into account the fact 
that the efficiency of copying is inversely propor- 
tional to the length of the mRNA. This results in the 
majority of the genetic information in a cDNA li- 
brary having an overabundance of incomplete 
pieces. 

Hence, an incomplete or non full-length cDNA 
usually does not have the entire genetic blueprint 
required to make a functional protein and is there- 
fore of limited scientific value. Usually, investigators 
must perform many rounds of isolation (screenings) 
and construct a "full-length" cDNA from the accu- 



mulated pieces. Consequently, valuable time and 
scientific resources are lost. Obviously, the prob- 
lem becomes even more acute when long cDNAs 
are sought. Additionally, some fragments of the 

5 desired cDNAs might be so underrepresented in 
the library that it may be impractical to identify and 
isolate all the required segments. 

Furthermore, in cDNA libraries produced by 
conventional methods, there is dismal under-repre- 

10 sentation of sequences close to .the 5 end of 
mRNAs since the reverse transcriptase will usually 
"fall off" before reaching these sequences. This is 
unfortunate since there is a growing interest in 
isolating these 5 proximal sequences, in light of 

is recent studies pointing to the importance of such 
sequences in regulating gene expression. 

Another problem concerning cDNA synthesis is 
the source and quality of the mRNA used. Using 
present day technology, the mRNA that is used as 

20 a source for cDNA synthesis is purified by its 3 
end polyadenylated tail. However, some mRNAs do 
not possess a 3' end but all mRNAs have a 5 cap 
structure. Consequently, a cDNA library construct- 
ed from this source of mRNA would be more 

25 representative of the total genetic information 
present in the cell. In recent years, unsuccessful 
attempts have been made to develop antibodies 
directed against the cap structure of mRNA. The 
problems usually encountered were related to the 

30 insufficient affinity of the antibodies for the cap. 
This major drawback made it impossible to develop 
isolation protocols for capped mRNAs. 

Therefore, it would be highly desirable to de- 
velop a method that would increase the ability of 

as scientists to isolate both full-length cDNA clones 
and capped mRNA. 

SUMMARY OF INVENTION 

In accordance with the present invention, there 
is provided a protein useful for the preparation of 
cDNA libraries mostly containing full-length cDNA 
clones. The protein can also be used for the isola- 

45 tion of capped mRNA. The protein of the present 
invention is a multifunctional protein comprising at 
least two functional sites. The first functional site 
has the ability to bind the cap structure of mRNA 
and the second functional site has the ability to 

so bind a solid support matrix in such a manner as to 
allow said first functional site to be immobilized 
and still remain functionally accessible to interact 
with the cap structure of mRNA. Preferably, a pro- 
tein of the present invention is a bifunctional fusion 
protein having one functional site that has the abil- 
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, ty to bind the cap structure ol mRNA from 
eucaryottc cells and another functional site having 
the ability to bind to a solid support matrix 
' * so within the scope of the present invention 
is a method for generating a cDNA library mosUy 
containing full-length cDNAs. This method com- 
prises a first step in which a mixture comprising 
m RNA:cDNA hybrids is incubated with 

1) a single-strand RNA specific nuclease, 

^ 2 ) a multifunctional protein comprising at 
least a first functional site having the ability to bind 
he cap structure of mRNA and a second funct.onal 
site having the ability to bind a solid support ma- 

^ The mixture is then passed through a column 
comprising a support matrix having the abi .ty o 
Snd'to the second functional site of the _prcwn in 
order to selectively bind complete mRNA.cDNA 
hybrids to the matrix. The mRNAcDNA hybrids are 
IL competitively eluted with a cap analog . and the 
.ull-length cDNA strands are then separated and 
recovered. Preferably, the single-strand RNA spe 
3£ nuclease that is used for incuba.ng he 
mRNA:cDNA hybrids mixture is ^ nuclease 
whereas the preferred cap analog is m GDP. 

A,so within the scope of the present invention 
is a method for purifying capped mRNA. This 
Method comprises the incubation of a m.x ure com- 
prising mRNA with a protein havmg a first func 
Sonal site which has the ability to bind the cap 
structure of mRNA and a second functional site 
having the ability to bind a solid support matrix^ 

TNs mixture is then P^V^te^bX to 
„ m r.ri<!in a a support matrix having the ability to 
functional site o, the rein in 
order to selectively bind capped mRNAs to the 
matrix. The capped mRNAs are then compejve y 
eluted with a cap analog such as m'GDP and thus 
capped mRNAs are separated and recovered. 
CSP Z i a preferred embodiment of the presen^n- 
vention the protein used for generating both cDNA 
IbSs containing full-length cDNAs and pure 
capped mRNAs is a Afunctional protein, preferably 
a fuston protein o. the type protein A/e.F-4E fusion 

Pr0t Rnally, the present invention also inciudes a 
resin for the purification of proteins having a func 
"onal cap binding site, said resin comprising an 
rx diLd^cap analog cova.ently attached to j , sc i d 
support matrix. Also included is a method for the 
preparation of the resin for the P»*«« to " of 
teins having a functional cap binding site, said 

^"^nalog to yield a reactiv dial- 

de co y vSn; d 'attaching said oxidized cap-ana.og to a 
solid support matrix. 



Therefore the product of the present invention 
The retore, l . * , & , ec ^ e binding of capped 
will allow, through its selective u y 
mRNA an improvement in the quality of cDNA 
STaTes tW in return, will allow the identif.cation 
, Xortant genes that are not part of presen day 
cDNA libraries. Furthermore, the product of the 
present invention can be used to purify capped 
mRNAs selectively in a reproducible manne 

Other advantages of the present invention will 
, 0 be readily illustrated by referring to the fol.owmg 
description. 



IN THE DRAWING 



mid. 



Figure 1 represents the P RIT2T/elF-4E plas- 



20 QETAILED DESCr^lPj^ON OF THE INVEJ^HON. 

The present invention relates to a novel protein 
useful construction of full-length cDNA libraries 
and the isolation of full-length cDNAs. 
25 Essentially, the product of the present invention 

has to be at least Afunctional in that it must have 
X abil'y to bind the cap structure of mRNA whUe 
Tso hal 9 the ability to bind a sol.d support 
matrix so that the cap binding portion of h,s pro- 
S can be immobilized and still remain function- 
30 2 accessiWe to interact with the mRNA cap 
s ^cture The'esulting product is a multifunctional 
protein that has the ability to purify capped 

. ^^^^^ 

soKd support. However, it is to be understood that. 

in ,he con« 0. ft. present — - 
useful to consider that one 01 me . 
steps in cDNA synthesis resells in • ™™ A ££. 
hybrid. When Ihis hybrid is obtained 1 neces 

- \<° rz B n r,, r sstzsx 

no, complete, then the! portion ol the rhfW* «ne 

- i,rrr; d e ;r:.rr ^1 «, 

ably lead to me _ ^ spe . 
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occur when using the fusion protein of the present 
invention that can bind cap structures because only 
the full-length mRNA:cDNA hybrids will possess a 
5 cap structure. The resulting cDNA library will 
then have full-length clones only and represents an 
ideal library for cDNA cloning. 

Cap structure and cap binding protein 

From all the eucaryotic cellular mRNAs that 
have been analyzed to date, all of these mRNAs 
have a structural modification at their 5 end 
termed the cap structure or "cap" which consists 
of the structure m'GpppX, where X can be any 
nucleotide. Certain proteins or protein portions, 
have the ability to bind the cap structure and are 
termed cap binding proteins (CBPs). Thus, if an 
mRNA has a cap structure, then the cap binding 
protein will specifically bind the cap in a non- 
covalent fashion. The affinity of the protein for the 
cap structure is high, readily allowing the specific 
retention of capped RNAs as opposed to uncapped 
RNAs. 

The product of the present invention requires a 
portion having the ability to bind the cap structure 
of mRNA. 

Preferably, a 24 kDa cap binding protein 
(CBP), which is known as eucaryotic initiation factor 
4E (elF-4E) may be used in the context of the 
present invention. This protein is found in the 
cytoplasm of all eucaryotic cells including animal, 
plant and yeast. However, it is to be understood 
that any protein or protein portion that can specifi- 
cally bind the cap structure of mRNA can be con- 
sidered as being a useful part of the present inven- 
tion. 



Solid support matrix binding proteins 

The second essential feature of the product of 
the present invention is that it must possess a 
portion having an affinity for molecules that could 
be bound to a solid support matrix. However, the 
product must be attached to the support matrix in 
such a manner as to allow the cap binding portion 
to interact with the cap structure of mRNAs. 

For example, staphylococcal protein A that has 
the ability to bind IgG immunoglobulins could be 
used in combination with a resin that has IgG 
antibodies attached to it. Also, it could be possible 
to use /8-galactosidase in conjunction with an anti-/S 
galactosidase antibody resin. In fact, any protein or 
protein portion that could possibly be linked to a 
solid support matrix could be used in the context of 
the present invention. 

Therefore, although the present invention will 
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highlight the use of a fusion protein containing both 
a cap binding protein and a protein having the 
ability to bind to a solid support matrix, it is to be 
understood that the present invention is not limited 
5 to these types of proteins. In fact, any multifunc- 
tional protein possessing the ability to bind both 
cap structures of mRNA and a solid support matrix 
could be useful in the context of the present inven- 
tion. 

10 

Process for the obtention of full-length cDNAs 

Once a mixture containing mRNAxDNA hy- 

15 brids has been obtained through methods generally 
known to those skilled in the art, it is incubated 
with a single-strand RNA specific nuclease. Prefer- 
ably. Ti nuclease (RNase Ti from Aspergillus 
oryzae), an endonuclease that specifically attacks 

20 the 3' adjacent phosphodiester bound GpN, can be 
used as a- single-stranded RNA specific nuclease. 
The naturally modified m 7 G part of the cap struc- 
ture will not be recognized by this enzyme. The 
use of RNAse Ti for probing single-strand specific 

25 regions is well documented and widely known to 
those skilled in the art. RNAs Ti will not attack 
RNA that is hybridized to DNA and it is therefore 
well suited for the purposes of the present inven- 
tion. However, It is to be understood that any 

30 single-strand RNA specific nuclease could also be 
used in the context of the present invention 

Thus, if the reverse transcriptase copies the 
entire length of the mRNA. or if it falls short of a 
few nucleotides such that there is. no unhybridized 

35 GpN residue in the corresponding mRNA. RNAse 
Ti, which will only digest unpaired GpN residues, 
will not degrade the mRNA and as a result, the cap 
structure will remain covalently bound to the 
mRNA:cDNA hybrid. If, however, cDNA synthesis 

40 was not complete, the single-strand RNA specific 
nuclease will degrade unpaired RNA and remove 
the cap structure from the mRNAxDNA hybrid. 

Following nuclease treatment, the mRNA.cDNA 
hybrids are incubated with the multifunctional pro- 

45 tein of the present invention. As a result of this 
incubation, only those mRNAxDNA hybrids that 
have a covalently attached cap structure will bind 
to the protein of the present invention. By applying 
the mixture to a resin having a strong affinity with a 

50 functional site of the protein of the present inven- 
tion, all the non-capped containing hybrids, or in- 
complete cDNAs, will wash through. The bound 
full-length capped mRNArcDNA hybrids will then 
be competitively eluted with a cap analog such as 

55 m'GDP. 

The resulting purified fraction contains only full- 
length or near full-length first strand cDNAs which 
then act as templates for second strand synthesis. 

4 



EP 0 373 914 A2 



The steps for completing the cDNA library are the 
same as those normally used by those skilled in 
the art. Essentially, the present invention lies in the 
fact that a novel step that discards incomplete 
cDNAs and readily selects for only full-length 
cDNAs to be present in the cDNA library has been 
added to standard cDNA preparation procedures. 

Affinity resin for purifying cap binding proteins 

The selective purification of cap binding pro- 
teins or fusion proteins with a functional cap bind- 
ing site is most efficiently accomplished by affinity 
chromatography using cap-analogs covalently at- 
tached to a solid support matrix. Although several 
cap-analog resins have been devised, and one is 
presently available from Pharmacia, a new cap- 
analog resin that is less expensive, very rapid, and 
less demanding to prepare than those previously 
reported forms part of the present invention. 

The synthesis of the cap-analog resin is per- 
formed in the following manner. A cap-analog, such 
as m 7 GDP, is oxidized in the presence of per.odate 
to yield a reactive dialdehyde. Upon incubation of 
the oxidized cap-analog with adipic-ac.d 
dihydrazide agarose (Pharmacia) a hydrazone bond 
is formed. The hydrazone bond is further stabilized 
bv reductive amination in the presence of sod.um 
cyanoborohydride (NaBHsCN). This results in a 
cap-analog covalently .attached (through a spacer) 
to a solid support matrix. The binding efficiency is 
approximately 90% of the input cap-analog and the 
resin is stable for months at 4°C. The Procedure 
requires minimal steps and all steps are based an 
simple chemical reactions. 

Affinity purification of capped mRNAs 

Independent of its use in constructing full- 
length cDNAs, the protein of the present invention, 
when used in combination with a suitable binding 
resin can be used to purify capped mRNAs. in 
cDNA synthesis, there are two major advantages of 
purifying mRNA by the cap structure rather than 
using the conventional poly A tail purification 

First, not all eucaryotic cellular mRNAs have a 
poly A tail at their 3' end, whereas all mRNAs 
analyzed to date have a 5 cap structure. Con- 
sequently, the source of mRNA punf.ed « 
more representee of the entire spectrum present m 

♦hp c@n 

Secondly, by purifying mRNAs by their cap 
structure, it is possible to minimize the Percentage 
of degraded mRNA molecules that are normally 
used as substrates for cDNA synthesis. This fea- 
ture is extremely important because one of the 



most variable and important criteria in the genera- 
tion of a good cDNA library is the quality of the 
mRNA that is used. If an mRNA is partially deg- 
raded, it can still be copied by the reverse tran- 
5 scriptase enzyme as long as there is a 3 poly A 
tail, thereby exacerbating the problem of incom- 
plete cDNA. 

However, if mRNA is purified by its cap struc- 
ture and it is partially degraded (i.e. 3 sequence 
(0 and poly A tail are not present), it w.ll not be a 
substrate for oligo(dT) primed reverse transcnpt.on. 
Only mRNAs which have a cap and a poly A ^ta.i 
simultaneously will be a substrate for cDHA . syn- 
thesis, invariably, only full-length mRNAs satisfy 
, 5 this criteria and their use will enhance the quality of 
Dres ent day cDNA libraries. 
P Tne must bear in mind that the 
mRNA is not always related to cDNA synthesis. For 
example, the in vitro synthesis of mRNA by using 
20 the SP6 system iPTomega-Biotec) is widely use* 
However, the ability to generate capped mRNAs , s 
somewhat variable as it pertains to the eff.aenc of 
capping. Therefore, a mixed population of capped 
and uncapped mRNA is synthesized and this m,x- 
25 ture could easily be separated using the system of 
the present invention. 

The following example is introduced in order to 
Illustrate rather than limit the scope of the present 
invention 

30 

Example 1_ 



35 



Construction of the protein A/elF-4E fusion protein. 



In order to produce the Afunctional protein 
A/elF-4E fusion protein, the yeast elF-4E gene was 
IS to staphylococcal protein A, by -comb.nant 
40 DNA technology. The elF-4E gene wa placed m 
front of protein A using Pharmacia vector pRITZT. 
This vector allows tor the efficient overproduce 
of a protein A/elF-4E fusion protein. 



45 



Yeast elF-4E gene 



The yeast elF-4E gene was isolated using the 
method described in Aitmann et al., Molecular and 

50 Cell Biology 7(1987) p. 998. To creat jttj fusion 
protein, the yeast e!F-4E gene was mutated by site 
Erected mutagenic in order to obtain a unique 
BamHI restriction site at the translation start coder, 
"he use of BamHl and Hind.l. enabled the isolation 

55 oMhe entire coding sequence of e-F -4E excepUo 
the first amino acid which is lost as a result of 
mutagenesis. 
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Protein A 



Staphylococcal protein A has the ability to bind 
IgG immunoglobulins. Protein A was used because 
the binding constant of prot in A to IgG is remark- 
ably high thereby minimizing the loss of fusion 
protein from the IgG resin. This feature is important 
because it allows the purification scheme to be 
repeated with the same material, thereby increas- 
ing the cost-effectiveness of the product. Further- 
more, IgG and the resin to which it is covalently 
bound is rather cheap, effective and easy to pre- 
pare. Finally, a commercially available gene fusion 
vector sold by Pharmacia under the name pRIT2T 
with protein A sequences placed in an appropriate 
location allows for an easy overproduction of pro- 
tein A fusion protein. 



Introduction of elF-4E into pRIT2T and transforma- 
tion of E. coli f 

The mutated yeast elF-4E gene described 
above is subcloned into KS vector (Pharmacia) into 
Baml-Hindlll site and subsequently cut with Hindlll. 
the ends are Klenow repaired, BamHI linkers are 
then added, and the desired elF-4E fragment is cut 
with BamHI and isolated using standard proce- 
dures. The pRIT2T vector is then cut with BamHI 
and the mutated elF-4E gene is then ligated to the 
pRIT2T vector and transformed into E. coliN4830-1. 
The transformation procedures are those generally 
used by those skilled in the art. The resulting 
transformed E. coli strain was given the designation 
A-4E. The plasmid containing the desired elF-4E 
fragment was deposited at the American Type Cul- 
ture Collection (ATCC) and given the accession 
number 40522. 



Expression and isolation of the protein A/elF-4E 
fusion protein ~ — — 

The use of the pRIT2T vector allows for the 
efficient temperature-inducible expression of in- 
tracellular fusion proteins in E. coli. Following the 
manufacturer's (Pharmacia) procedure, the trans- 
formed E. coli cells are grown to an O.D.600 value 
of approximately 1.0 at 3.0°C. The temperature is 
then raised from 30°C to 42°C tor 2 hours. The 
culture is then sonicated in a buffer containing a 
mild detergent and centrifuged at low speed spin in 
order to discard cellular debris. The supernatant 
liquid is then centrifuged at high speed in order to 
obtain high yields of the fusion protein. This high 
speed centrifugation step is not part of the proce- 
dure described by the manufacturer and was intro- 
duced in order to enhance production yields. 



The overexpressed protein is then purified to 
homogeneity by passing the E. coli extract over a 
cap analog affinity resin of the type described 
above such as m'GDP-agarose. Only the fusion 

5 protein binds the cap-analog resin because of its 
affinity for caps and the other contaminating pro- 
teins are removed by washing the affinity resin with 
low salt containing buffer. 

The bound fusion protein is then specifically 

70 eluted with saturating amounts of a cap analog 
such as m 7 GDP, which competes for cap specific 
binding sites on the fusion protein. The excess 
m 7 GDP present with the purified fusion protein is 
removed by dialysis to yield the fusion protein that 

15 can bind cap structures. Approximately 2 to 3 
milligrams of pure fusion protein can be obtained 
for each liter of culture media. The fusion protein 
thus obtained has proven to be stable for several 
months at 4°C, apart from being easily overproduc- 

20 ed and purified by simple and inexpensive meth- 
ods. 



Solid-support matrix used for immobilization of the 
25 fusion protein. 

In order to immobilize the fusion protein of the 
present invention, it is necessary to use a resin that 
has an IgG antibody attached to it. This allows for 

30 the specific retention of the fusion protein through 
its protein A portion, thereby allowing the elF-4E 
portion to be free to interact with cap mRNAs. 
Resins of that type are presently available com- 
mercially but it was found that the commercially 

35 available resins especially those sold by Pharmacia 
were contaminated with nucleases that degrade 
mRNA, thereby making it impossible to isolate 
good quality mRNA For the purposes of the 
present invention, a resin synthesized using IgG 

40 antibodies obtained from ICN and Affigel-10 resin 
from Bio-Rad has been used. The column has 
been found to be stable for at least several months 
at 4°C. 

45 

Claims 

1 . A protein comprising at least a first func- 
tional site having the ability to bind the cap struc- 

so ture of mRNA and a second functional site having 
the ability to bind a solid support matrix in such a 
manner as to allow said first functional site to be 
immobilized and still remain functionally accessible 
to interact with the cap structure of mRNA. 

ss 2. The protein of Claim 1, which is a Afunc- 

tional protein. 

3. The protein of Claim i, which is a fusion 
protein. 
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4. The protein of Claim 3, which is the protein 
A/elF-4E fusion protein. 

5. The protein of Claim 1 . wherein the first 
functional site has the ability to bind the cap struc- 
ture of mRNA from eucaryotic cells. 

6. The protein of Claim 1, wherein the first 
functional site is the functional site of elF-4E. 

7. The protein of Claim 1, wherein (he second 
functional site is the functional site of protein A 
having an affinity for IgG antibodies. 

8. A method for generating a cDNA library 
mostly containing full-length cDNAs, said method 
comprising: - incubating a mixture comprising 
mRNArcDNA hybrids with 

1 ) a single-strand RNA specific nuclease 

2) a protein comprising at least a first func- 
tional site having the ability to bind the cap struc- 
ture of mRNA and a second functional site having 
the ability to bind a support matrix in such a 
manner as to allow said first functional site to be 
immobilized and still remain functionally accessible 
to interact with the cap structure of mRNA. - pass- 
ing said mixture through a column comprising a 
support matrix having the ability to bind to said 
second functional site of said protein, thereby se- 
lectively binding complete mRNA:cDNA hybrids to 
said matrix: 

" - eluting said complete mRNA:cDNA hybrids with a 
cap analog; and 

- separating and recovering full length cDNA 
strands. 

9. The method of Claim 8, wherein said single- 
strand ftNA specific nuclease is Ti nuclease. 

10. The method of Claim 8, wherein said pro- 
tein is the protein A/elF-4E fusion protein. 

11. The method of Claim 8. wherein said cap 
analog is m 7 GDP. 

12. A method for purifying capped mRNA 
which comprises: 

- incubating a mixture containing mRNA with a 
protein comprising at least a first functional site 
having the ability to bind the cap structure of 
mRNA and a second functional site having the 
ability to bind a solid support matrix in such a 
manner as to allow said first functional site to be 
immobilized and still remain functionally accessible 
to interact with the cap structure of mRNA; 

- incubating said mixture with a support matrix 
having the ability to bind to said second functional 
site of said protein, thereby selectively binding 
capped mRNAs to said matrix; 

competitively eluting said capped mRNAs with a 
cap analog; and 

• separating and recovering capped mRNAs. 

13. The method of Claim 12, wherein said 
protein is the protein A/elF-4E fusion protein. 

14. The method of Claim 12, wherein said cap 
analog is m'GDP. 



15. A plasmid having the identifying character- 
istics of ATCC number 4G522. 

16. A resin for the purification of proteins hav- 
ing a functional cap binding site, said resin com- 

5 prising an oxidized cap analog covalently attached 
to a solid support matrix. 

17. A method for the preparation of the resin of 
Claim 16, said method comprising: 

- oxidizing a cap-analog to yield a reactive dial- 
w dehyde, and; 

- covalently attaching said oxidized cap-analog to a 
solid support matrix. 
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site of the above-mentioned protein in order to se- 
lectively bind complete mRNA:cDNA hybrids, The 
mRNA:cDNA hybrids are then competitively eluted 
with a cap analog and full-length cDNA strands are 
separated and recovered. The present invention also 
includes a method for purifying capped mRNA using 
the above-mentioned protein. The process com- 
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tional site of the above-mentioned protein in order to 
selectively bind capped mRNAs, and competitively 
eluting the capped mRNAs with a cap analog. 
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© A protein comprising at least a first functional 
site having the ability to bind the cap structure of 
mRNA and a second functional site having the ability 
^ to bind a solid support matrix in such a manner as to 
^ allow the first functional site to be immobilized and 
still remain functionally accessible to interact with 
the cap structure of mRNA. Also within the scope of 
gj the present invention is a method for generating a 
cDNA library mostly containing full-length cDNAs. 
jj^ The method comprises the incubation of a mixture 
{*) comprising mRNAxDNA hybrids with 1) a single 
strand RNA specific nuclease and 2) the above- 
mentioned protein. The resulting mixture is then 
Q. passed through a column comprising a support ma- 
m trix having the ability to bind the second functional 
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